Dataset statistics
| Number of variables | 20 |
|---|---|
| Number of observations | 371528 |
| Missing cells | 184008 |
| Missing cells (%) | 2.5% |
| Duplicate rows | 4 |
| Duplicate rows (%) | < 0.1% |
| Total size in memory | 56.7 MiB |
| Average record size in memory | 160.0 B |
Variable types
| DateTime | 3 |
|---|---|
| Text | 2 |
| Categorical | 9 |
| Numeric | 6 |
nrOfPictures has constant value "" | Constant |
| Dataset has 4 (< 0.1%) duplicate rows | Duplicates |
seller is highly imbalanced (> 99.9%) | Imbalance |
offerType is highly imbalanced (99.9%) | Imbalance |
fuelType is highly imbalanced (62.6%) | Imbalance |
vehicleType has 37869 (10.2%) missing values | Missing |
gearbox has 20209 (5.4%) missing values | Missing |
model has 20484 (5.5%) missing values | Missing |
fuelType has 33386 (9.0%) missing values | Missing |
notRepairedDamage has 72060 (19.4%) missing values | Missing |
price is highly skewed (γ1 = 578.0590837) | Skewed |
yearOfRegistration is highly skewed (γ1 = 72.13364168) | Skewed |
powerPS is highly skewed (γ1 = 58.19990873) | Skewed |
price has 10778 (2.9%) zeros | Zeros |
powerPS has 40820 (11.0%) zeros | Zeros |
monthOfRegistration has 37675 (10.1%) zeros | Zeros |
Reproduction
| Analysis started | 2024-03-17 03:41:42.475863 |
|---|---|
| Analysis finished | 2024-03-17 03:42:01.489279 |
| Duration | 19.01 seconds |
| Software version | ydata-profiling vv4.6.5 |
| Download configuration | config.json |
dateCrawled
Date
| Distinct | 280500 |
|---|---|
| Distinct (%) | 75.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.8 MiB |
| Minimum | 2016-03-05 14:06:22 |
|---|---|
| Maximum | 2016-04-07 14:36:58 |
name
Text
| Distinct | 233531 |
|---|---|
| Distinct (%) | 62.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.8 MiB |
Length
| Max length | 34700 |
|---|---|
| Median length | 51 |
| Mean length | 31.993328 |
| Min length | 4 |
Characters and Unicode
| Total characters | 11886417 |
|---|---|
| Distinct characters | 142 |
| Distinct categories | 18 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 206644 ? |
|---|---|
| Unique (%) | 55.6% |
Sample
| 1st row | Golf_3_1.6 |
|---|---|
| 2nd row | A5_Sportback_2.7_Tdi |
| 3rd row | Jeep_Grand_Cherokee_"Overland" |
| 4th row | GOLF_4_1_4__3TÜRER |
| 5th row | Skoda_Fabia_1.4_TDI_PD_Classic |
| Value | Count | Frequency (%) |
| opel_corsa | 818 | 0.2% |
| ford_fiesta | 779 | 0.2% |
| bmw_318i | 632 | 0.2% |
| volkswagen_golf_1.4 | 605 | 0.2% |
| renault_twingo | 585 | 0.2% |
| opel_corsa_b | 534 | 0.1% |
| bmw_316i | 531 | 0.1% |
| bmw_320i | 494 | 0.1% |
| volkswagen_polo | 494 | 0.1% |
| opel_astra | 462 | 0.1% |
| Other values (223930) | 366783 |
Most occurring characters
| Value | Count | Frequency (%) |
| _ | 1861882 | 15.7% |
| e | 799031 | 6.7% |
| a | 598077 | 5.0% |
| o | 503656 | 4.2% |
| i | 486921 | 4.1% |
| t | 463512 | 3.9% |
| r | 441007 | 3.7% |
| n | 434666 | 3.7% |
| l | 350646 | 2.9% |
| u | 331886 | 2.8% |
| Other values (132) | 5615133 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6320607 | |
| Uppercase Letter | 2299790 | 19.3% |
| Connector Punctuation | 1861882 | 15.7% |
| Decimal Number | 1093609 | 9.2% |
| Other Punctuation | 297037 | 2.5% |
| Math Symbol | 8484 | 0.1% |
| Control | 1927 | < 0.1% |
| Dash Punctuation | 1776 | < 0.1% |
| Space Separator | 888 | < 0.1% |
| Modifier Symbol | 228 | < 0.1% |
| Other values (8) | 189 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 799031 | |
| a | 598077 | 9.5% |
| o | 503656 | 8.0% |
| i | 486921 | 7.7% |
| t | 463512 | 7.3% |
| r | 441007 | 7.0% |
| n | 434666 | 6.9% |
| l | 350646 | 5.5% |
| u | 331886 | 5.3% |
| s | 315187 | 5.0% |
| Other values (30) | 1596018 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 211682 | 9.2% |
| A | 182644 | 7.9% |
| V | 162167 | 7.1% |
| S | 158031 | 6.9% |
| C | 149991 | 6.5% |
| M | 133523 | 5.8% |
| D | 126654 | 5.5% |
| P | 113542 | 4.9% |
| I | 113151 | 4.9% |
| B | 108493 | 4.7% |
| Other values (28) | 839912 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 191529 | |
| / | 37327 | 12.6% |
| ! | 35267 | 11.9% |
| * | 13391 | 4.5% |
| , | 5624 | 1.9% |
| " | 4520 | 1.5% |
| & | 2996 | 1.0% |
| : | 2854 | 1.0% |
| ? | 2297 | 0.8% |
| ; | 618 | 0.2% |
| Other values (7) | 614 | 0.2% |
Control
| Value | Count | Frequency (%) |
| € | 1327 | |
| 296 | 15.4% | |
| • | 144 | 7.5% |
| – | 96 | 5.0% |
| “ | 35 | 1.8% |
| „ | 15 | 0.8% |
| Â… | 6 | 0.3% |
| ” | 3 | 0.2% |
| Š | 2 | 0.1% |
| 1 | 0.1% | |
| Other values (2) | 2 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 236238 | |
| 0 | 212743 | |
| 2 | 172877 | |
| 6 | 98251 | |
| 3 | 93274 | 8.5% |
| 4 | 77150 | 7.1% |
| 5 | 63547 | 5.8% |
| 8 | 55672 | 5.1% |
| 7 | 44172 | 4.0% |
| 9 | 39685 | 3.6% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 6429 | |
| | | 947 | 11.2% |
| ~ | 490 | 5.8% |
| > | 281 | 3.3% |
| < | 216 | 2.5% |
| = | 69 | 0.8% |
| × | 52 | 0.6% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 137 | |
| ` | 73 | |
| ^ | 18 | 7.9% |
Other Number
| Value | Count | Frequency (%) |
| ³ | 15 | |
| ² | 6 | 27.3% |
| ½ | 1 | 4.5% |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 45 | |
| ® | 3 | 6.2% |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 34 | |
| ¥ | 1 | 2.9% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1861882 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1776 |
Space Separator
| Value | Count | Frequency (%) |
| 888 |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 32 |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 32 |
Final Punctuation
| Value | Count | Frequency (%) |
| » | 9 |
Format
| Value | Count | Frequency (%) |
| Â | 7 |
Initial Punctuation
| Value | Count | Frequency (%) |
| « | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8620397 | |
| Common | 3266020 | 27.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 799031 | 9.3% |
| a | 598077 | 6.9% |
| o | 503656 | 5.8% |
| i | 486921 | 5.6% |
| t | 463512 | 5.4% |
| r | 441007 | 5.1% |
| n | 434666 | 5.0% |
| l | 350646 | 4.1% |
| u | 331886 | 3.9% |
| s | 315187 | 3.7% |
| Other values (68) | 3895808 |
Common
| Value | Count | Frequency (%) |
| _ | 1861882 | |
| 1 | 236238 | 7.2% |
| 0 | 212743 | 6.5% |
| . | 191529 | 5.9% |
| 2 | 172877 | 5.3% |
| 6 | 98251 | 3.0% |
| 3 | 93274 | 2.9% |
| 4 | 77150 | 2.4% |
| 5 | 63547 | 1.9% |
| 8 | 55672 | 1.7% |
| Other values (54) | 202857 | 6.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11850215 | |
| None | 36202 | 0.3% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| _ | 1861882 | 15.7% |
| e | 799031 | 6.7% |
| a | 598077 | 5.0% |
| o | 503656 | 4.3% |
| i | 486921 | 4.1% |
| t | 463512 | 3.9% |
| r | 441007 | 3.7% |
| n | 434666 | 3.7% |
| l | 350646 | 3.0% |
| u | 331886 | 2.8% |
| Other values (83) | 5578931 |
None
| Value | Count | Frequency (%) |
| Ü | 29724 | |
| ë | 2885 | 8.0% |
| € | 1327 | 3.7% |
| é | 937 | 2.6% |
| Ä | 361 | 1.0% |
| Ö | 234 | 0.6% |
| • | 144 | 0.4% |
| ´ | 137 | 0.4% |
| – | 96 | 0.3% |
| × | 52 | 0.1% |
| Other values (39) | 305 | 0.8% |
seller
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.8 MiB |
| privat | |
|---|---|
| gewerblich | 3 |
Length
| Max length | 10 |
|---|---|
| Median length | 6 |
| Mean length | 6.0000323 |
| Min length | 6 |
Characters and Unicode
| Total characters | 2229180 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | privat |
|---|---|
| 2nd row | privat |
| 3rd row | privat |
| 4th row | privat |
| 5th row | privat |
Common Values
| Value | Count | Frequency (%) |
| privat | 371525 | |
| gewerblich | 3 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| privat | 371525 | |
| gewerblich | 3 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 371528 | |
| i | 371528 | |
| p | 371525 | |
| v | 371525 | |
| a | 371525 | |
| t | 371525 | |
| e | 6 | < 0.1% |
| g | 3 | < 0.1% |
| w | 3 | < 0.1% |
| b | 3 | < 0.1% |
| Other values (3) | 9 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2229180 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 371528 | |
| i | 371528 | |
| p | 371525 | |
| v | 371525 | |
| a | 371525 | |
| t | 371525 | |
| e | 6 | < 0.1% |
| g | 3 | < 0.1% |
| w | 3 | < 0.1% |
| b | 3 | < 0.1% |
| Other values (3) | 9 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2229180 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 371528 | |
| i | 371528 | |
| p | 371525 | |
| v | 371525 | |
| a | 371525 | |
| t | 371525 | |
| e | 6 | < 0.1% |
| g | 3 | < 0.1% |
| w | 3 | < 0.1% |
| b | 3 | < 0.1% |
| Other values (3) | 9 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2229180 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 371528 | |
| i | 371528 | |
| p | 371525 | |
| v | 371525 | |
| a | 371525 | |
| t | 371525 | |
| e | 6 | < 0.1% |
| g | 3 | < 0.1% |
| w | 3 | < 0.1% |
| b | 3 | < 0.1% |
| Other values (3) | 9 | < 0.1% |
offerType
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.8 MiB |
| Angebot | |
|---|---|
| Gesuch | 12 |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 6.9999677 |
| Min length | 6 |
Characters and Unicode
| Total characters | 2600684 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Angebot |
|---|---|
| 2nd row | Angebot |
| 3rd row | Angebot |
| 4th row | Angebot |
| 5th row | Angebot |
Common Values
| Value | Count | Frequency (%) |
| Angebot | 371516 | |
| Gesuch | 12 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| angebot | 371516 | |
| gesuch | 12 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 371528 | |
| A | 371516 | |
| n | 371516 | |
| g | 371516 | |
| b | 371516 | |
| o | 371516 | |
| t | 371516 | |
| G | 12 | < 0.1% |
| s | 12 | < 0.1% |
| u | 12 | < 0.1% |
| Other values (2) | 24 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2229156 | |
| Uppercase Letter | 371528 | 14.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 371528 | |
| n | 371516 | |
| g | 371516 | |
| b | 371516 | |
| o | 371516 | |
| t | 371516 | |
| s | 12 | < 0.1% |
| u | 12 | < 0.1% |
| c | 12 | < 0.1% |
| h | 12 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 371516 | |
| G | 12 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2600684 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 371528 | |
| A | 371516 | |
| n | 371516 | |
| g | 371516 | |
| b | 371516 | |
| o | 371516 | |
| t | 371516 | |
| G | 12 | < 0.1% |
| s | 12 | < 0.1% |
| u | 12 | < 0.1% |
| Other values (2) | 24 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2600684 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 371528 | |
| A | 371516 | |
| n | 371516 | |
| g | 371516 | |
| b | 371516 | |
| o | 371516 | |
| t | 371516 | |
| G | 12 | < 0.1% |
| s | 12 | < 0.1% |
| u | 12 | < 0.1% |
| Other values (2) | 24 | < 0.1% |
price
Real number (ℝ)
SKEWED  ZEROS 
| Distinct | 5597 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 17295.142 |
| Minimum | 0 |
|---|---|
| Maximum | 2.1474836 × 109 |
| Zeros | 10778 |
| Zeros (%) | 2.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.8 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 200 |
| Q1 | 1150 |
| median | 2950 |
| Q3 | 7200 |
| 95-th percentile | 19790 |
| Maximum | 2.1474836 × 109 |
| Range | 2.1474836 × 109 |
| Interquartile range (IQR) | 6050 |
Descriptive statistics
| Standard deviation | 3587953.7 |
|---|---|
| Coefficient of variation (CV) | 207.45443 |
| Kurtosis | 345433.32 |
| Mean | 17295.142 |
| Median Absolute Deviation (MAD) | 2200 |
| Skewness | 578.05908 |
| Sum | 6.4256295 × 109 |
| Variance | 1.2873412 × 1013 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 10778 | 2.9% |
| 500 | 5670 | 1.5% |
| 1500 | 5394 | 1.5% |
| 1000 | 4649 | 1.3% |
| 1200 | 4594 | 1.2% |
| 2500 | 4438 | 1.2% |
| 600 | 3819 | 1.0% |
| 3500 | 3792 | 1.0% |
| 800 | 3784 | 1.0% |
| 2000 | 3432 | 0.9% |
| Other values (5587) | 321178 |
| Value | Count | Frequency (%) |
| 0 | 10778 | |
| 1 | 1189 | 0.3% |
| 2 | 12 | < 0.1% |
| 3 | 8 | < 0.1% |
| 4 | 1 | < 0.1% |
| 5 | 26 | < 0.1% |
| 7 | 3 | < 0.1% |
| 8 | 9 | < 0.1% |
| 9 | 8 | < 0.1% |
| 10 | 84 | < 0.1% |
| Value | Count | Frequency (%) |
| 2147483647 | 1 | < 0.1% |
| 99999999 | 15 | |
| 99000000 | 1 | < 0.1% |
| 74185296 | 1 | < 0.1% |
| 32545461 | 1 | < 0.1% |
| 27322222 | 1 | < 0.1% |
| 14000500 | 1 | < 0.1% |
| 12345678 | 9 | |
| 11111111 | 10 | |
| 10010011 | 1 | < 0.1% |
abtest
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.8 MiB |
| test | |
|---|---|
| control |
Length
| Max length | 7 |
|---|---|
| Median length | 4 |
| Mean length | 5.4449221 |
| Min length | 4 |
Characters and Unicode
| Total characters | 2022941 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | test |
|---|---|
| 2nd row | test |
| 3rd row | test |
| 4th row | test |
| 5th row | test |
Common Values
| Value | Count | Frequency (%) |
| test | 192585 | |
| control | 178943 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| test | 192585 | |
| control | 178943 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 564113 | |
| o | 357886 | |
| e | 192585 | 9.5% |
| s | 192585 | 9.5% |
| c | 178943 | 8.8% |
| n | 178943 | 8.8% |
| r | 178943 | 8.8% |
| l | 178943 | 8.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2022941 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 564113 | |
| o | 357886 | |
| e | 192585 | 9.5% |
| s | 192585 | 9.5% |
| c | 178943 | 8.8% |
| n | 178943 | 8.8% |
| r | 178943 | 8.8% |
| l | 178943 | 8.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2022941 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 564113 | |
| o | 357886 | |
| e | 192585 | 9.5% |
| s | 192585 | 9.5% |
| c | 178943 | 8.8% |
| n | 178943 | 8.8% |
| r | 178943 | 8.8% |
| l | 178943 | 8.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2022941 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 564113 | |
| o | 357886 | |
| e | 192585 | 9.5% |
| s | 192585 | 9.5% |
| c | 178943 | 8.8% |
| n | 178943 | 8.8% |
| r | 178943 | 8.8% |
| l | 178943 | 8.8% |
vehicleType
Categorical
MISSING 
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 37869 |
| Missing (%) | 10.2% |
| Memory size | 2.8 MiB |
| limousine | |
|---|---|
| kleinwagen | |
| kombi | |
| bus | |
| cabrio | |
| Other values (3) |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 7.1582814 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2388425 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | coupe |
|---|---|
| 2nd row | suv |
| 3rd row | kleinwagen |
| 4th row | kleinwagen |
| 5th row | limousine |
Common Values
| Value | Count | Frequency (%) |
| limousine | 95894 | |
| kleinwagen | 80023 | |
| kombi | 67564 | |
| bus | 30201 | 8.1% |
| cabrio | 22898 | 6.2% |
| coupe | 19015 | 5.1% |
| suv | 14707 | 4.0% |
| andere | 3357 | 0.9% |
| (Missing) | 37869 | 10.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| limousine | 95894 | |
| kleinwagen | 80023 | |
| kombi | 67564 | |
| bus | 30201 | 9.1% |
| cabrio | 22898 | 6.9% |
| coupe | 19015 | 5.7% |
| suv | 14707 | 4.4% |
| andere | 3357 | 1.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 362273 | |
| e | 281669 | |
| n | 259297 | |
| o | 205371 | |
| l | 175917 | |
| m | 163458 | |
| u | 159817 | |
| k | 147587 | 6.2% |
| s | 140802 | 5.9% |
| b | 120663 | 5.1% |
| Other values (8) | 371571 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2388425 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 362273 | |
| e | 281669 | |
| n | 259297 | |
| o | 205371 | |
| l | 175917 | |
| m | 163458 | |
| u | 159817 | |
| k | 147587 | 6.2% |
| s | 140802 | 5.9% |
| b | 120663 | 5.1% |
| Other values (8) | 371571 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2388425 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 362273 | |
| e | 281669 | |
| n | 259297 | |
| o | 205371 | |
| l | 175917 | |
| m | 163458 | |
| u | 159817 | |
| k | 147587 | 6.2% |
| s | 140802 | 5.9% |
| b | 120663 | 5.1% |
| Other values (8) | 371571 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2388425 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 362273 | |
| e | 281669 | |
| n | 259297 | |
| o | 205371 | |
| l | 175917 | |
| m | 163458 | |
| u | 159817 | |
| k | 147587 | 6.2% |
| s | 140802 | 5.9% |
| b | 120663 | 5.1% |
| Other values (8) | 371571 |
yearOfRegistration
Real number (ℝ)
SKEWED 
| Distinct | 155 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2004.578 |
| Minimum | 1000 |
|---|---|
| Maximum | 9999 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.8 MiB |
Quantile statistics
| Minimum | 1000 |
|---|---|
| 5-th percentile | 1992 |
| Q1 | 1999 |
| median | 2003 |
| Q3 | 2008 |
| 95-th percentile | 2016 |
| Maximum | 9999 |
| Range | 8999 |
| Interquartile range (IQR) | 9 |
Descriptive statistics
| Standard deviation | 92.866598 |
|---|---|
| Coefficient of variation (CV) | 0.046327256 |
| Kurtosis | 5667.8597 |
| Mean | 2004.578 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 72.133642 |
| Sum | 7.4475685 × 108 |
| Variance | 8624.2049 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2000 | 24551 | 6.6% |
| 1999 | 22767 | 6.1% |
| 2005 | 22316 | 6.0% |
| 2006 | 20230 | 5.4% |
| 2001 | 20218 | 5.4% |
| 2003 | 19873 | 5.3% |
| 2004 | 19746 | 5.3% |
| 2002 | 19189 | 5.2% |
| 1998 | 17951 | 4.8% |
| 2007 | 17673 | 4.8% |
| Other values (145) | 167014 |
| Value | Count | Frequency (%) |
| 1000 | 38 | |
| 1001 | 1 | < 0.1% |
| 1039 | 1 | < 0.1% |
| 1111 | 4 | < 0.1% |
| 1200 | 1 | < 0.1% |
| 1234 | 4 | < 0.1% |
| 1253 | 1 | < 0.1% |
| 1255 | 1 | < 0.1% |
| 1300 | 2 | < 0.1% |
| 1400 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 9999 | 27 | |
| 9996 | 1 | < 0.1% |
| 9450 | 1 | < 0.1% |
| 9229 | 1 | < 0.1% |
| 9000 | 5 | < 0.1% |
| 8888 | 2 | < 0.1% |
| 8500 | 1 | < 0.1% |
| 8455 | 1 | < 0.1% |
| 8200 | 1 | < 0.1% |
| 8000 | 2 | < 0.1% |
gearbox
Categorical
MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 20209 |
| Missing (%) | 5.4% |
| Memory size | 2.8 MiB |
| manuell | |
|---|---|
| automatik |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 7.4389458 |
| Min length | 7 |
Characters and Unicode
| Total characters | 2613443 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | manuell |
|---|---|
| 2nd row | manuell |
| 3rd row | automatik |
| 4th row | manuell |
| 5th row | manuell |
Common Values
| Value | Count | Frequency (%) |
| manuell | 274214 | |
| automatik | 77105 | 20.8% |
| (Missing) | 20209 | 5.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| manuell | 274214 | |
| automatik | 77105 | 21.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 548428 | |
| a | 428424 | |
| m | 351319 | |
| u | 351319 | |
| n | 274214 | |
| e | 274214 | |
| t | 154210 | 5.9% |
| o | 77105 | 3.0% |
| i | 77105 | 3.0% |
| k | 77105 | 3.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2613443 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 548428 | |
| a | 428424 | |
| m | 351319 | |
| u | 351319 | |
| n | 274214 | |
| e | 274214 | |
| t | 154210 | 5.9% |
| o | 77105 | 3.0% |
| i | 77105 | 3.0% |
| k | 77105 | 3.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2613443 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 548428 | |
| a | 428424 | |
| m | 351319 | |
| u | 351319 | |
| n | 274214 | |
| e | 274214 | |
| t | 154210 | 5.9% |
| o | 77105 | 3.0% |
| i | 77105 | 3.0% |
| k | 77105 | 3.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2613443 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 548428 | |
| a | 428424 | |
| m | 351319 | |
| u | 351319 | |
| n | 274214 | |
| e | 274214 | |
| t | 154210 | 5.9% |
| o | 77105 | 3.0% |
| i | 77105 | 3.0% |
| k | 77105 | 3.0% |
powerPS
Real number (ℝ)
SKEWED  ZEROS 
| Distinct | 794 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 115.54948 |
| Minimum | 0 |
|---|---|
| Maximum | 20000 |
| Zeros | 40820 |
| Zeros (%) | 11.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.8 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 70 |
| median | 105 |
| Q3 | 150 |
| 95-th percentile | 231 |
| Maximum | 20000 |
| Range | 20000 |
| Interquartile range (IQR) | 80 |
Descriptive statistics
| Standard deviation | 192.13958 |
|---|---|
| Coefficient of variation (CV) | 1.6628338 |
| Kurtosis | 4424.2988 |
| Mean | 115.54948 |
| Median Absolute Deviation (MAD) | 38 |
| Skewness | 58.199909 |
| Sum | 42929866 |
| Variance | 36917.617 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 40820 | 11.0% |
| 75 | 24035 | 6.5% |
| 60 | 15907 | 4.3% |
| 150 | 15442 | 4.2% |
| 140 | 13585 | 3.7% |
| 101 | 13313 | 3.6% |
| 90 | 12748 | 3.4% |
| 116 | 11963 | 3.2% |
| 170 | 10982 | 3.0% |
| 105 | 10429 | 2.8% |
| Other values (784) | 202304 |
| Value | Count | Frequency (%) |
| 0 | 40820 | |
| 1 | 34 | < 0.1% |
| 2 | 10 | < 0.1% |
| 3 | 9 | < 0.1% |
| 4 | 30 | < 0.1% |
| 5 | 103 | < 0.1% |
| 6 | 11 | < 0.1% |
| 7 | 11 | < 0.1% |
| 8 | 7 | < 0.1% |
| 9 | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| 20000 | 1 | |
| 19312 | 1 | |
| 19211 | 1 | |
| 19208 | 1 | |
| 17932 | 1 | |
| 17700 | 1 | |
| 17410 | 1 | |
| 17322 | 1 | |
| 17019 | 1 | |
| 17011 | 1 |
model
Text
MISSING 
| Distinct | 251 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 20484 |
| Missing (%) | 5.5% |
| Memory size | 2.8 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 15 |
| Mean length | 5.059719 |
| Min length | 2 |
Characters and Unicode
| Total characters | 1776184 |
|---|---|
| Distinct characters | 37 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | golf |
|---|---|
| 2nd row | grand |
| 3rd row | golf |
| 4th row | fabia |
| 5th row | 3er |
| Value | Count | Frequency (%) |
| golf | 30070 | 8.6% |
| andere | 26400 | 7.5% |
| 3er | 20567 | 5.9% |
| polo | 13092 | 3.7% |
| corsa | 12573 | 3.6% |
| astra | 10830 | 3.1% |
| passat | 10306 | 2.9% |
| a4 | 10257 | 2.9% |
| c_klasse | 8775 | 2.5% |
| 5er | 8546 | 2.4% |
| Other values (241) | 199628 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 226087 | |
| e | 214406 | |
| r | 168226 | 9.5% |
| o | 151660 | 8.5% |
| s | 134093 | 7.5% |
| l | 91767 | 5.2% |
| t | 82624 | 4.7% |
| i | 73848 | 4.2% |
| n | 72726 | 4.1% |
| c | 66825 | 3.8% |
| Other values (27) | 493922 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1634246 | |
| Decimal Number | 96705 | 5.4% |
| Connector Punctuation | 45233 | 2.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 226087 | |
| e | 214406 | |
| r | 168226 | |
| o | 151660 | |
| s | 134093 | 8.2% |
| l | 91767 | 5.6% |
| t | 82624 | 5.1% |
| i | 73848 | 4.5% |
| n | 72726 | 4.5% |
| c | 66825 | 4.1% |
| Other values (16) | 351984 |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 31582 | |
| 5 | 13242 | |
| 4 | 12748 | |
| 1 | 10488 | 10.8% |
| 6 | 8820 | 9.1% |
| 0 | 7570 | 7.8% |
| 2 | 5611 | 5.8% |
| 7 | 2703 | 2.8% |
| 8 | 2400 | 2.5% |
| 9 | 1541 | 1.6% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 45233 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1634246 | |
| Common | 141938 | 8.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 226087 | |
| e | 214406 | |
| r | 168226 | |
| o | 151660 | |
| s | 134093 | 8.2% |
| l | 91767 | 5.6% |
| t | 82624 | 5.1% |
| i | 73848 | 4.5% |
| n | 72726 | 4.5% |
| c | 66825 | 4.1% |
| Other values (16) | 351984 |
Common
| Value | Count | Frequency (%) |
| _ | 45233 | |
| 3 | 31582 | |
| 5 | 13242 | 9.3% |
| 4 | 12748 | 9.0% |
| 1 | 10488 | 7.4% |
| 6 | 8820 | 6.2% |
| 0 | 7570 | 5.3% |
| 2 | 5611 | 4.0% |
| 7 | 2703 | 1.9% |
| 8 | 2400 | 1.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1776184 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 226087 | |
| e | 214406 | |
| r | 168226 | 9.5% |
| o | 151660 | 8.5% |
| s | 134093 | 7.5% |
| l | 91767 | 5.2% |
| t | 82624 | 4.7% |
| i | 73848 | 4.2% |
| n | 72726 | 4.1% |
| c | 66825 | 3.8% |
| Other values (27) | 493922 |
kilometer
Real number (ℝ)
| Distinct | 13 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 125618.69 |
| Minimum | 5000 |
|---|---|
| Maximum | 150000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.8 MiB |
Quantile statistics
| Minimum | 5000 |
|---|---|
| 5-th percentile | 30000 |
| Q1 | 125000 |
| median | 150000 |
| Q3 | 150000 |
| 95-th percentile | 150000 |
| Maximum | 150000 |
| Range | 145000 |
| Interquartile range (IQR) | 25000 |
Descriptive statistics
| Standard deviation | 40112.337 |
|---|---|
| Coefficient of variation (CV) | 0.31931823 |
| Kurtosis | 1.2229142 |
| Mean | 125618.69 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -1.5515773 |
| Sum | 4.667086 × 1010 |
| Variance | 1.6089996 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 150000 | 240797 | |
| 125000 | 38067 | 10.2% |
| 100000 | 15920 | 4.3% |
| 90000 | 12523 | 3.4% |
| 80000 | 11053 | 3.0% |
| 70000 | 9773 | 2.6% |
| 60000 | 8669 | 2.3% |
| 50000 | 7615 | 2.0% |
| 5000 | 7069 | 1.9% |
| 40000 | 6376 | 1.7% |
| Other values (3) | 13666 | 3.7% |
| Value | Count | Frequency (%) |
| 5000 | 7069 | |
| 10000 | 1949 | 0.5% |
| 20000 | 5676 | |
| 30000 | 6041 | |
| 40000 | 6376 | |
| 50000 | 7615 | |
| 60000 | 8669 | |
| 70000 | 9773 | |
| 80000 | 11053 | |
| 90000 | 12523 |
| Value | Count | Frequency (%) |
| 150000 | 240797 | |
| 125000 | 38067 | 10.2% |
| 100000 | 15920 | 4.3% |
| 90000 | 12523 | 3.4% |
| 80000 | 11053 | 3.0% |
| 70000 | 9773 | 2.6% |
| 60000 | 8669 | 2.3% |
| 50000 | 7615 | 2.0% |
| 40000 | 6376 | 1.7% |
| 30000 | 6041 | 1.6% |
monthOfRegistration
Real number (ℝ)
ZEROS 
| Distinct | 13 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.7344453 |
| Minimum | 0 |
|---|---|
| Maximum | 12 |
| Zeros | 37675 |
| Zeros (%) | 10.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.8 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 3 |
| median | 6 |
| Q3 | 9 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 12 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 3.7124123 |
|---|---|
| Coefficient of variation (CV) | 0.64738822 |
| Kurtosis | -1.1428356 |
| Mean | 5.7344453 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 0.079107888 |
| Sum | 2130507 |
| Variance | 13.782005 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 37675 | |
| 3 | 36170 | |
| 6 | 33167 | |
| 4 | 30918 | |
| 5 | 30631 | |
| 7 | 28958 | |
| 10 | 27337 | 7.4% |
| 11 | 25489 | 6.9% |
| 12 | 25380 | 6.8% |
| 9 | 25074 | 6.7% |
| Other values (3) | 70729 |
| Value | Count | Frequency (%) |
| 0 | 37675 | |
| 1 | 24561 | |
| 2 | 22403 | |
| 3 | 36170 | |
| 4 | 30918 | |
| 5 | 30631 | |
| 6 | 33167 | |
| 7 | 28958 | |
| 8 | 23765 | |
| 9 | 25074 |
| Value | Count | Frequency (%) |
| 12 | 25380 | |
| 11 | 25489 | |
| 10 | 27337 | |
| 9 | 25074 | |
| 8 | 23765 | |
| 7 | 28958 | |
| 6 | 33167 | |
| 5 | 30631 | |
| 4 | 30918 | |
| 3 | 36170 |
fuelType
Categorical
IMBALANCE  MISSING 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 33386 |
| Missing (%) | 9.0% |
| Memory size | 2.8 MiB |
| benzin | |
|---|---|
| diesel | |
| lpg | 5378 |
| cng | 571 |
| hybrid | 278 |
| Other values (2) | 312 |
Length
| Max length | 7 |
|---|---|
| Median length | 6 |
| Mean length | 5.947528 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2011109 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | benzin |
|---|---|
| 2nd row | diesel |
| 3rd row | diesel |
| 4th row | benzin |
| 5th row | diesel |
Common Values
| Value | Count | Frequency (%) |
| benzin | 223857 | |
| diesel | 107746 | |
| lpg | 5378 | 1.4% |
| cng | 571 | 0.2% |
| hybrid | 278 | 0.1% |
| andere | 208 | 0.1% |
| elektro | 104 | < 0.1% |
| (Missing) | 33386 | 9.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| benzin | 223857 | |
| diesel | 107746 | |
| lpg | 5378 | 1.6% |
| cng | 571 | 0.2% |
| hybrid | 278 | 0.1% |
| andere | 208 | 0.1% |
| elektro | 104 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 448493 | |
| e | 439973 | |
| i | 331881 | |
| b | 224135 | |
| z | 223857 | |
| l | 113228 | 5.6% |
| d | 108232 | 5.4% |
| s | 107746 | 5.4% |
| g | 5949 | 0.3% |
| p | 5378 | 0.3% |
| Other values (8) | 2237 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2011109 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 448493 | |
| e | 439973 | |
| i | 331881 | |
| b | 224135 | |
| z | 223857 | |
| l | 113228 | 5.6% |
| d | 108232 | 5.4% |
| s | 107746 | 5.4% |
| g | 5949 | 0.3% |
| p | 5378 | 0.3% |
| Other values (8) | 2237 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2011109 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 448493 | |
| e | 439973 | |
| i | 331881 | |
| b | 224135 | |
| z | 223857 | |
| l | 113228 | 5.6% |
| d | 108232 | 5.4% |
| s | 107746 | 5.4% |
| g | 5949 | 0.3% |
| p | 5378 | 0.3% |
| Other values (8) | 2237 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2011109 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 448493 | |
| e | 439973 | |
| i | 331881 | |
| b | 224135 | |
| z | 223857 | |
| l | 113228 | 5.6% |
| d | 108232 | 5.4% |
| s | 107746 | 5.4% |
| g | 5949 | 0.3% |
| p | 5378 | 0.3% |
| Other values (8) | 2237 | 0.1% |
brand
Categorical
| Distinct | 40 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.8 MiB |
| volkswagen | |
|---|---|
| bmw | |
| opel | |
| mercedes_benz | |
| audi | |
| Other values (35) |
Length
| Max length | 14 |
|---|---|
| Median length | 13 |
| Mean length | 6.7532864 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2509035 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | volkswagen |
|---|---|
| 2nd row | audi |
| 3rd row | jeep |
| 4th row | volkswagen |
| 5th row | skoda |
Common Values
| Value | Count | Frequency (%) |
| volkswagen | 79640 | |
| bmw | 40274 | |
| opel | 40136 | |
| mercedes_benz | 35309 | |
| audi | 32873 | |
| ford | 25573 | 6.9% |
| renault | 17969 | 4.8% |
| peugeot | 11027 | 3.0% |
| fiat | 9676 | 2.6% |
| seat | 7022 | 1.9% |
| Other values (30) | 72029 |
Length
| Value | Count | Frequency (%) |
| volkswagen | 79640 | |
| bmw | 40274 | |
| opel | 40136 | |
| mercedes_benz | 35309 | |
| audi | 32873 | |
| ford | 25573 | 6.9% |
| renault | 17969 | 4.8% |
| peugeot | 11027 | 3.0% |
| fiat | 9676 | 2.6% |
| seat | 7022 | 1.9% |
| Other values (30) | 72029 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 330339 | 13.2% |
| a | 207305 | 8.3% |
| o | 205135 | 8.2% |
| s | 169113 | 6.7% |
| n | 163877 | 6.5% |
| l | 148193 | 5.9% |
| w | 120456 | 4.8% |
| d | 114816 | 4.6% |
| r | 103102 | 4.1% |
| m | 95327 | 3.8% |
| Other values (15) | 851372 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2466629 | |
| Connector Punctuation | 42406 | 1.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 330339 | |
| a | 207305 | 8.4% |
| o | 205135 | 8.3% |
| s | 169113 | 6.9% |
| n | 163877 | 6.6% |
| l | 148193 | 6.0% |
| w | 120456 | 4.9% |
| d | 114816 | 4.7% |
| r | 103102 | 4.2% |
| m | 95327 | 3.9% |
| Other values (14) | 808966 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 42406 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2466629 | |
| Common | 42406 | 1.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 330339 | |
| a | 207305 | 8.4% |
| o | 205135 | 8.3% |
| s | 169113 | 6.9% |
| n | 163877 | 6.6% |
| l | 148193 | 6.0% |
| w | 120456 | 4.9% |
| d | 114816 | 4.7% |
| r | 103102 | 4.2% |
| m | 95327 | 3.9% |
| Other values (14) | 808966 |
Common
| Value | Count | Frequency (%) |
| _ | 42406 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2509035 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 330339 | 13.2% |
| a | 207305 | 8.3% |
| o | 205135 | 8.2% |
| s | 169113 | 6.7% |
| n | 163877 | 6.5% |
| l | 148193 | 5.9% |
| w | 120456 | 4.8% |
| d | 114816 | 4.6% |
| r | 103102 | 4.1% |
| m | 95327 | 3.8% |
| Other values (15) | 851372 |
notRepairedDamage
Categorical
MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 72060 |
| Missing (%) | 19.4% |
| Memory size | 2.8 MiB |
| nein | |
|---|---|
| ja |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 3.7576636 |
| Min length | 2 |
Characters and Unicode
| Total characters | 1125300 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ja |
|---|---|
| 2nd row | nein |
| 3rd row | nein |
| 4th row | ja |
| 5th row | nein |
Common Values
| Value | Count | Frequency (%) |
| nein | 263182 | |
| ja | 36286 | 9.8% |
| (Missing) | 72060 | 19.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| nein | 263182 | |
| ja | 36286 | 12.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 526364 | |
| e | 263182 | |
| i | 263182 | |
| j | 36286 | 3.2% |
| a | 36286 | 3.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1125300 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 526364 | |
| e | 263182 | |
| i | 263182 | |
| j | 36286 | 3.2% |
| a | 36286 | 3.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1125300 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 526364 | |
| e | 263182 | |
| i | 263182 | |
| j | 36286 | 3.2% |
| a | 36286 | 3.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1125300 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 526364 | |
| e | 263182 | |
| i | 263182 | |
| j | 36286 | 3.2% |
| a | 36286 | 3.2% |
dateCreated
Date
| Distinct | 114 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.8 MiB |
| Minimum | 2014-03-10 00:00:00 |
|---|---|
| Maximum | 2016-04-07 00:00:00 |
nrOfPictures
Categorical
CONSTANT 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.8 MiB |
| 0 |
|---|
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 371528 |
|---|---|
| Distinct characters | 1 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 371528 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 371528 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 371528 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 371528 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 371528 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 371528 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 371528 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 371528 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 371528 |
postalCode
Real number (ℝ)
| Distinct | 8150 |
|---|---|
| Distinct (%) | 2.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 50820.668 |
| Minimum | 1067 |
|---|---|
| Maximum | 99998 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.8 MiB |
Quantile statistics
| Minimum | 1067 |
|---|---|
| 5-th percentile | 9661 |
| Q1 | 30459 |
| median | 49610 |
| Q3 | 71546 |
| 95-th percentile | 93133 |
| Maximum | 99998 |
| Range | 98931 |
| Interquartile range (IQR) | 41087 |
Descriptive statistics
| Standard deviation | 25799.082 |
|---|---|
| Coefficient of variation (CV) | 0.50764942 |
| Kurtosis | -0.97577938 |
| Mean | 50820.668 |
| Median Absolute Deviation (MAD) | 20731 |
| Skewness | 0.06188007 |
| Sum | 1.8881301 × 1010 |
| Variance | 6.6559266 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10115 | 828 | 0.2% |
| 65428 | 637 | 0.2% |
| 66333 | 349 | 0.1% |
| 38518 | 326 | 0.1% |
| 44145 | 323 | 0.1% |
| 32257 | 323 | 0.1% |
| 52525 | 314 | 0.1% |
| 78224 | 309 | 0.1% |
| 26789 | 301 | 0.1% |
| 48599 | 294 | 0.1% |
| Other values (8140) | 367524 |
| Value | Count | Frequency (%) |
| 1067 | 96 | |
| 1068 | 1 | < 0.1% |
| 1069 | 59 | |
| 1097 | 29 | < 0.1% |
| 1099 | 67 | |
| 1108 | 12 | < 0.1% |
| 1109 | 80 | |
| 1127 | 31 | < 0.1% |
| 1129 | 46 | |
| 1139 | 67 |
| Value | Count | Frequency (%) |
| 99998 | 16 | < 0.1% |
| 99996 | 3 | < 0.1% |
| 99994 | 7 | < 0.1% |
| 99991 | 2 | < 0.1% |
| 99988 | 9 | < 0.1% |
| 99986 | 19 | < 0.1% |
| 99976 | 37 | < 0.1% |
| 99974 | 159 | |
| 99958 | 9 | < 0.1% |
| 99955 | 23 | < 0.1% |
lastSeen
Date
| Distinct | 182806 |
|---|---|
| Distinct (%) | 49.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.8 MiB |
| Minimum | 2016-03-05 14:15:08 |
|---|---|
| Maximum | 2016-04-07 14:58:51 |
| dateCrawled | name | seller | offerType | price | abtest | vehicleType | yearOfRegistration | gearbox | powerPS | model | kilometer | monthOfRegistration | fuelType | brand | notRepairedDamage | dateCreated | nrOfPictures | postalCode | lastSeen | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2016-03-24 11:52:17 | Golf_3_1.6 | privat | Angebot | 480 | test | NaN | 1993 | manuell | 0 | golf | 150000 | 0 | benzin | volkswagen | NaN | 2016-03-24 00:00:00 | 0 | 70435 | 2016-04-07 03:16:57 |
| 1 | 2016-03-24 10:58:45 | A5_Sportback_2.7_Tdi | privat | Angebot | 18300 | test | coupe | 2011 | manuell | 190 | NaN | 125000 | 5 | diesel | audi | ja | 2016-03-24 00:00:00 | 0 | 66954 | 2016-04-07 01:46:50 |
| 2 | 2016-03-14 12:52:21 | Jeep_Grand_Cherokee_"Overland" | privat | Angebot | 9800 | test | suv | 2004 | automatik | 163 | grand | 125000 | 8 | diesel | jeep | NaN | 2016-03-14 00:00:00 | 0 | 90480 | 2016-04-05 12:47:46 |
| 3 | 2016-03-17 16:54:04 | GOLF_4_1_4__3TÜRER | privat | Angebot | 1500 | test | kleinwagen | 2001 | manuell | 75 | golf | 150000 | 6 | benzin | volkswagen | nein | 2016-03-17 00:00:00 | 0 | 91074 | 2016-03-17 17:40:17 |
| 4 | 2016-03-31 17:25:20 | Skoda_Fabia_1.4_TDI_PD_Classic | privat | Angebot | 3600 | test | kleinwagen | 2008 | manuell | 69 | fabia | 90000 | 7 | diesel | skoda | nein | 2016-03-31 00:00:00 | 0 | 60437 | 2016-04-06 10:17:21 |
| 5 | 2016-04-04 17:36:23 | BMW_316i___e36_Limousine___Bastlerfahrzeug__Export | privat | Angebot | 650 | test | limousine | 1995 | manuell | 102 | 3er | 150000 | 10 | benzin | bmw | ja | 2016-04-04 00:00:00 | 0 | 33775 | 2016-04-06 19:17:07 |
| 6 | 2016-04-01 20:48:51 | Peugeot_206_CC_110_Platinum | privat | Angebot | 2200 | test | cabrio | 2004 | manuell | 109 | 2_reihe | 150000 | 8 | benzin | peugeot | nein | 2016-04-01 00:00:00 | 0 | 67112 | 2016-04-05 18:18:39 |
| 7 | 2016-03-21 18:54:38 | VW_Derby_Bj_80__Scheunenfund | privat | Angebot | 0 | test | limousine | 1980 | manuell | 50 | andere | 40000 | 7 | benzin | volkswagen | nein | 2016-03-21 00:00:00 | 0 | 19348 | 2016-03-25 16:47:58 |
| 8 | 2016-04-04 23:42:13 | Ford_C___Max_Titanium_1_0_L_EcoBoost | privat | Angebot | 14500 | control | bus | 2014 | manuell | 125 | c_max | 30000 | 8 | benzin | ford | NaN | 2016-04-04 00:00:00 | 0 | 94505 | 2016-04-04 23:42:13 |
| 9 | 2016-03-17 10:53:50 | VW_Golf_4_5_tuerig_zu_verkaufen_mit_Anhaengerkupplung | privat | Angebot | 999 | test | kleinwagen | 1998 | manuell | 101 | golf | 150000 | 0 | NaN | volkswagen | NaN | 2016-03-17 00:00:00 | 0 | 27472 | 2016-03-31 17:17:06 |
| dateCrawled | name | seller | offerType | price | abtest | vehicleType | yearOfRegistration | gearbox | powerPS | model | kilometer | monthOfRegistration | fuelType | brand | notRepairedDamage | dateCreated | nrOfPictures | postalCode | lastSeen | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 371518 | 2016-04-02 20:37:03 | Bmw_320_D_DPF_Touring_!!! | privat | Angebot | 3999 | test | kombi | 2005 | manuell | 3 | 3er | 150000 | 5 | diesel | bmw | nein | 2016-04-02 00:00:00 | 0 | 81825 | 2016-04-06 20:47:12 |
| 371519 | 2016-03-09 13:37:43 | Alfa_Romeo_159_Jtdm_1.9_150_ps_13_600_km_top_voll | privat | Angebot | 5250 | control | NaN | 2016 | automatik | 150 | 159 | 150000 | 12 | NaN | alfa_romeo | nein | 2016-03-09 00:00:00 | 0 | 51371 | 2016-03-13 01:44:13 |
| 371520 | 2016-03-19 19:53:49 | turbo_defekt | privat | Angebot | 3200 | control | limousine | 2004 | manuell | 225 | leon | 150000 | 5 | benzin | seat | ja | 2016-03-19 00:00:00 | 0 | 96465 | 2016-03-19 20:44:43 |
| 371521 | 2016-03-27 20:36:20 | Opel_Zafira_1.6_Elegance_TÜV_12/16 | privat | Angebot | 1150 | control | bus | 2000 | manuell | 0 | zafira | 150000 | 3 | benzin | opel | nein | 2016-03-27 00:00:00 | 0 | 26624 | 2016-03-29 10:17:23 |
| 371522 | 2016-03-21 09:50:58 | Mitsubishi_Cold | privat | Angebot | 0 | control | NaN | 2005 | manuell | 0 | colt | 150000 | 7 | benzin | mitsubishi | ja | 2016-03-21 00:00:00 | 0 | 2694 | 2016-03-21 10:42:49 |
| 371523 | 2016-03-14 17:48:27 | Suche_t4___vito_ab_6_sitze | privat | Angebot | 2200 | test | NaN | 2005 | NaN | 0 | NaN | 20000 | 1 | NaN | sonstige_autos | NaN | 2016-03-14 00:00:00 | 0 | 39576 | 2016-04-06 00:46:52 |
| 371524 | 2016-03-05 19:56:21 | Smart_smart_leistungssteigerung_100ps | privat | Angebot | 1199 | test | cabrio | 2000 | automatik | 101 | fortwo | 125000 | 3 | benzin | smart | nein | 2016-03-05 00:00:00 | 0 | 26135 | 2016-03-11 18:17:12 |
| 371525 | 2016-03-19 18:57:12 | Volkswagen_Multivan_T4_TDI_7DC_UY2 | privat | Angebot | 9200 | test | bus | 1996 | manuell | 102 | transporter | 150000 | 3 | diesel | volkswagen | nein | 2016-03-19 00:00:00 | 0 | 87439 | 2016-04-07 07:15:26 |
| 371526 | 2016-03-20 19:41:08 | VW_Golf_Kombi_1_9l_TDI | privat | Angebot | 3400 | test | kombi | 2002 | manuell | 100 | golf | 150000 | 6 | diesel | volkswagen | NaN | 2016-03-20 00:00:00 | 0 | 40764 | 2016-03-24 12:45:21 |
| 371527 | 2016-03-07 19:39:19 | BMW_M135i_vollausgestattet_NP_52.720____Euro | privat | Angebot | 28990 | control | limousine | 2013 | manuell | 320 | m_reihe | 50000 | 8 | benzin | bmw | nein | 2016-03-07 00:00:00 | 0 | 73326 | 2016-03-22 03:17:10 |
Most frequently occurring
| dateCrawled | name | seller | offerType | price | abtest | vehicleType | yearOfRegistration | gearbox | powerPS | model | kilometer | monthOfRegistration | fuelType | brand | notRepairedDamage | dateCreated | nrOfPictures | postalCode | lastSeen | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2016-03-08 18:42:48 | Mercedes_Benz_CLK_Coupe_230_Kompressor_Sport | privat | Angebot | 1799 | test | coupe | 1999 | automatik | 193 | clk | 20000 | 7 | benzin | mercedes_benz | nein | 2016-03-08 00:00:00 | 0 | 89518 | 2016-03-09 09:46:57 | 2 |
| 1 | 2016-03-18 18:46:15 | Volkswagen_Passat_Variant_1.9_TDI_Highline | privat | Angebot | 1999 | control | kombi | 2001 | manuell | 131 | passat | 150000 | 7 | diesel | volkswagen | nein | 2016-03-18 00:00:00 | 0 | 36391 | 2016-03-18 18:46:15 | 2 |
| 2 | 2016-03-28 00:56:10 | Suzuki_Ignis | privat | Angebot | 1000 | control | kleinwagen | 2002 | manuell | 83 | andere | 150000 | 1 | benzin | suzuki | nein | 2016-03-28 00:00:00 | 0 | 66589 | 2016-03-28 08:46:21 | 2 |
| 3 | 2016-04-03 09:01:15 | Mercedes_Benz_CLK_320_W209 | privat | Angebot | 4699 | test | coupe | 2003 | automatik | 218 | clk | 125000 | 6 | benzin | mercedes_benz | ja | 2016-04-03 00:00:00 | 0 | 75196 | 2016-04-07 09:44:54 | 2 |